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TCGCCCCACGCGTCCGCACCGCCGCCCAGGCAAGGCCGCCCTGCCTTGGG 

tSEQ.ID NO: 1 

SEQ.ID NO: 2 — M AGGRGAP 8 
CGCAGCGCTGCC ATG GCT GGG GGC CGT GGG GCC CCC 24 
SEQ.ID NO: 3 
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RPDACGPVKEP 52 
CGG CCG GAC GCT TGC GGA CCG GTC AAA GAG CCC 156 



PEINLVLYPQG 63 
CCT GAA ATC AAT TTA GTT TTG TAC CCT CAA GGC 189 



LTGEEVYVKVD 74 
CTA ACT GGT GAA GAA GTA TAT GTA AAA GTG GAT 222 



LRVKCPPTYPD 85 
TTG AGG GTT AAA TGC CCA CCT ACC TAT CCA GAT 255 



VVPEI ELKNAK 96 
GTA GTT CCT GAA ATA GAG TTA AAA AAT GCC AAA 288 



GLSNESVNLLK 107 
GGT CTA TCA AAT GAA AGT GTC AAT TTG TTA AAA 321 



SRLEELAKKHC 118 
TCT CGC CTA GAA GAA CTG GCC AAG AAA CAC TGT 354 



GEVMI FELAYH 129 

GGG GAG GTG ATG ATC TTT GAA CTG GCT TAC CAC 387 

VQSFLSEHNKP 140 

GTG CAG TCA TTT CTC AGC GAG CAT AAC AAG CCC 420 
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CCT CCC AAG TCT TTT CAT GAA GAA ATG CTG GAA 453 



R R A Q E E 
AGG CGG GCT CAG GAG GAG 

E A K R K E 
GAG GCC AAG CGG AAA GAA 

E I L H E I 
GAA ATC CTG CAT GAG ATT 

E I K E E K 
GAG ATA AAA GAA GAG AAA 

A K Q E R L 
GCT AAG CAG GAA CGT TTG 

S N Q D H T 

TCA AAC CAA GAT CAT ACC 

G G H R T A 
GGA GGA CAC AGA ACG GCT 

G S P D F V 
GGC TCT CCT GAC TTT GTA 

R A N S S G 
CGG GCA AAC TCC TCA GGA 

R Q Y S V C 
CGT CAG TAT TCT GTA TGT 

P G S C E I 
CCT GGC TCT TGT GAA ATT 



Q Q R L L 162 

CAG CAG AGG CTG TTG 48 6 

E Q E Q R 173 

GAG CAG GAG CAA CGT 519 

Q R R K E 184 

CAG AGA AGG AAA GAA 552 

K R K E M 195 

AAA AGG AAA GAA ATG 585 

E I A S L 206 

GAA ATT GCT AGT TTG 618 

S K K D P 217 

TCT AAG AAG GAC CCA 651 

A I L H G 228 

GCC ATT CTA CAT GGA 684 

G N G K H 239 

GGA AAT GGT AAA CAT 717 

R S R R E 250 

AGG TCT AGG CGA GAA 750 

N S E D S 261 

AAT AGT GAA GAT TCT 783 

L Y F N M 272 

CTG TAT TTC AAT ATG 816 



G S P D Q L 
GGG AGT CCT GAT CAG CTC 

FIG. 
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KCIGSDEQLGK 294 

AAA TGT ATT GGC AGT GAT GAA CAA CTT GGA AAA 882 

LVYNALETATG 305 

TTA GTC TAC AAT GCT TTG GAA ACA GCC ACT GGT 915 

GFVLLYEWVLQ 316 

GGC TTT GTC TTG TTG TAT GAG TGG GTC CTT CAG 948 

WQKKMGPFLTS 327 

TGG CAG AAA AAA ATG GGT CCA TTC CTT ACC AGT 981 

QEKEKIDKCKK 338 

CAA GAA AAA GAG AAG ATT GAT AAG TGC AAA AAG 1014 

QIQGTETEFNS 349 

CAG ATT CAA GGA ACA GAA ACA GAA TTC AAC TCA 1047 

LVKLSHPNVVR 360 

CTG GTA AAA TTG AGC CAT CCA AAT GTA GTA CGC 1080 

YLAMNLKEQDD 371 

TAC CTT GCA ATG AAT CTC AAA GAG CAA GAC GAC 1113 

SIVVDILVEHI 382 

TCC ATC GTG GTG GAC ATT TTA GTG GAG CAC ATT 114 6 

SGVSLAAHLSH 393 

AGT GGG GTC TCT CTT GCT GCA CAC CTG AGC CAC 1179 

SGPIPVHQLRR 404 

TCA GGC CCC ATC CCT GTG CAT CAG CTT CGC AGG 1212 

YTAQLLSGLDY 415 

TAC ACA GCT CAG CTC CTG TCA GGC CTT GAT TAT 1245 

LHSNSVVHKVL 426 

CTG CAC AGC AAT TCT GTG GTG CAT AAG GTC CTG 1278 
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SASNVLVDAEG 
AGT GCA TCT AAT GTC TTG GTG GAT GCA GAA GGC 

TVKITDYSISK 
ACC GTC AAG ATT ACG GAC TAT AGC ATT TCT AAG 

RLADI CKEDVF 
CGC CTC GCA GAC ATT TGC AAG GAG GAT GTG TTT 

EQTRVRFSDNA 
GAG CAA ACC CGA GTT CGT TTT AGT GAC AAT GCT 

LPYKTGKKGDV 
CTG CCT TAT AAA ACG GGG AAG AAA GGA GAT GTT 

WRLGLLLLSLS 
TGG CGT CTT GGC CTT CTG CTG CTG TCC CTC AGC 

QGQECGEYPVT 
CAA GGA CAG GAA TGT GGA GAG TAC CCT GTG ACC 

IPSDLPADFQD 
ATC CCT AGT GAC TTA CCA GCT GAC TTT CAA GAT 

FLKKCVCLDDK 
TTT CTA AAG AAA TGT GTG TGC TTG GAT GAC AAG 

ERWSPQQLLKH 
GAA AGA TGG AGT CCC CAG CAG TTG TTG AAA CAC 

SFINPQPKMPL 
AGC TTT ATA AAT CCC CAG CCA AAA ATG CCT CTA 

VEQSPEDSGGQ 
GTG GAA CAA AGT CCT GAA GAT TCT GGA GGA CAA 

DYVETVIPSNR 
GAT TAT GTT GAG ACT GTT ATT CCT AGC AAC CGG 

LPSAAFFSETQ 
CTA CCC AGT GCT GCC TTC TTT AGT GAG ACA CAG 
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AVIKVQNKLDG 613 
GCT GTC ATC AAG GTG CAG AAC AAG TTG GAC GGC 1839 



2 
< 

s 



CCYAVKRIPIN 624 
TGC TGC TAC GCA GTG AAG CGC ATC CCC ATC AAC 1872 



PASRQFRRI KG 635 

CCG GCC AGC CGG CAG TTC CGC AGG ATC AAG GGC 1905 

EVTLLSRLHHE 646 

GAA GTG ACA CTG CTG TCA CGG CTG CAC CAT GAG 1938 

NIVRYYNAWIE 657 

AAC ATT GTG CGC TAC TAC AAC GCC TGG ATC GAG 1971 

RHERPAGPGTP 668 

CGG CAC GAG CGG CCG GCG GGA CCG GGG ACG CCG 2004 

PPDSGPLAKDD 679 

CCC CCG GAC TCC GGG CCC CTG GCC AAG GAT GAC 2037 

RAARGQPASDT 690 

CGA GCT GCA CGC GGG CAG CCG GCG AGC GAC ACA 207 0 

DGLDSVEAAAP 701 

GAC GGC CTG GAC AGC GTA GAG GCC GCC GCG CCG 2103 

PPILSSSVEWS 712 

CCA CCC ATC CTC AGC AGC TCG GTG GAG TGG AGC 213 6 



TSGERSASARF 723 
ACT TCG GGC GAG CGC TCG GCC AGT GCC CGT TTC 2169 
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PATGPGSSDDE 
CCC GCC ACC GGC CCG GGC TCC AGC GAT GAC GAG 

DDDEDEHGGVF 
GAC GAC GAC GAG GAC GAG CAC GGT GGC GTC TTC 

SQSFLPASDSE 
TCC CAG TCC TTC CTG CCT GCT TCA GAT TCT GAA 

SDIIFDNEDEN 
AGT GAT ATT ATC TTT GAC AAT GAA GAT GAG AAC 



QMEYCEKSTLR 
CAG ATG GAG TAC TGT GAG AAG AGC ACT TTA CGA 

DTIDQGLYRDT 
GAC ACC ATT GAC CAG GGA CTG TAT CGA GAC ACC 

VRLWRLFREIL 
GTC AGA CTC TGG AGG CTT TTT CGA GAG ATT CTG 



734 
2202 

745 
2235 

756 
2268 

767 
2301 



SKSQNQDEDCN 778 

AGT AAA AGT CAG AAT CAG GAT GAA GAT TCC AAT 2334 

EKNGCHESEPS 789 

GAA AAG AAT GGC TGC CAT GAA AGT GAG CCA TCA 2367 

VTTEAVHYLYI 800 

GTG ACG ACT GAG GCT GTG CAC TAC CTA TAC ATC 2400 



811 
2433 

822 
2466 

833 
2499 



DGLAYIHEKGM 844 

GAT GGA TTA GCT TAT ATC CAT GAG AAA GGA ATG 2532 

IHRDLKPVNIF 855 

ATT CAC CGG GAT TTG AAG CCT GTC AAC ATT TTT 2565 

LDSDDHVKIGD 866 

TTG GAT TCT GAT GAC CAT GTG AAA ATA GGT GAT 2598 

FIG. 1F 



7/53 

F G L A T D • H 
TTT GGT TTG GCG ACA GAC CAT 

A D S K Q D D 
GCT GAC AGC AAA CAA GAC GAT 

L I K S D P S 
TTG ATT AAG TCA GAC CCT TCA 

G M V G T A L 
GGG ATG GTT GGC ACT GCT CTC 

E V Q G S T K 
GAG GTC CAA GGA AGC ACC AAA 

Q K V D L F S 
CAG AAA GTG GAT CTC TTC AGC 

F F E M S Y H 
TTC TTT GAG ATG TCC TAT CAC 

A S E R I F V 
GCT TCA GAA AGG ATC TTT GTT 

R D P T S P K 
AGA GAT CCC ACT TCG CCT AAG 

F D D G E H A 
TTT GAC GAT GGA GAG CAT GCA 

V I S W L L N 
GTC ATC TCC TGG CTG TTG AAC 

K R P T A T E 
AAA CGG CCC ACA GCC ACA GAA 

E L L P P P Q 
GAG CTG CTG CCC CCA CCC CAG 

E L H E V L H 
GAG CTG CAT GAA GTG CTG CAC 



L A F S 877 

CTA GCC TTT TCT 2631 

Q T G D 888 

CAG ACA GGA GAC 2664 

G H L T 899 

GGT CAC TTA ACT 2 697 

Y V S P 910 

TAT GTA AGC CCA 2730 

S A Y N 921 

TCT GCA TAC AAC 27 63 

L G I I 932 

CTG GGA ATT ATC 27 96 

P M V T 943 

CCC ATG GTC ACG 2829 

L N Q L 954 

CTC AAC CAA CTC 28 62 

F P E D 965 

TTT CCA GAA GAC 28 95 

K Q K S 976 

AAG CAG AAA TCA 2928 

H D P A 987 

CAC GAT CCA GCA 2961 

L L K S 998 

CTG CTC AAG AGT 2994 

M E E S 1009 

ATG GAG GAG TCA 3027 

H T L T 1020 

CAC ACG CTG ACC 30 60 
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NVDGKAYRTMM 1031 
AAC GTG GAT GGG AAG GCC TAC CGC ACC ATG ATG 3093 



m 

Sol" 

ILl 



AQIFSQRISPA 1042 

GCC CAG ATC TTC TCG CAG CGC ATC TCC CCT GCC 312 6 

IDYTYDSDILK 1053 

ATC GAT TAC ACC TAT GAC AGC GAC ATA CTG AAG 3159 



GNFSIRTAKMQ 1064 

GGC AAC TTC TCA ATC CGT ACA GCC AAG ATG CAG 3192 

QHVCETIIRIF 1075 

CAG CAT GTG TGT GAA ACC ATC ATC CGC ATC TTT 3225 

KRHGAVQLCTP 1086 

AAA AGA CAT GGA GCT GTT CAG TTG TGT ACT CCA 3258 

LLLPRNRQIYE 1097 

CTA CTG CTT CCC CGA AAC AGA CAA ATA TAT GAG 32 91 

HNEAALFMDHS 1108 

CAC AAC GAA GCT GCC CTA TTC ATG GAC CAC AGC 3324 

GMLVMLPFDLR 1119 

GGG ATG CTG GTG ATG CTT CCT TTT GAC CTG CGG 3357 

I PFARYVARNN 1130 

ATC CCT TTT GCA AGA TAT GTG GCA AGA AAT AAT 3390 

ILNLKRYCIER 1141 

ATA TTG AAT TTA AAA CGA TAC TGC ATA GAA CGT 3423 

VFRPRKLDRFH 1152 

GTG TTC AGG CCG CGC AAG TTA GAT CGA TTT CAT 345 6 



PKELLECAFDI 1163 
CCC AAA GAA CTT CTG GAG TGT GCA TTT GAT ATT 34 8 9 
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VTSTTNSFLPT 
GTC ACT TCT ACC ACC AAC AGC TTT CTG CCC ACT 



1174 
3522 




AEIIYTIYEII 1185 

GCT GAA ATT ATC TAC ACT ATC TAT GAA ATC ATC 3555 

QEFPALQERNY 1196 

CAA GAG TTT CCA GCA CTT CAG GAA AGA AAT TAC 3588 

SIYLNHTMLLK 1207 

AGT ATT TAT TTG AAC CAT ACC ATG TTA TTG AAA 3621 

AILLHCGI PED 1218 

GCA ATA CTC TTA CAC TGT GGG ATC CCA GAA GAT 3654 



K L S Q V Y 
AAA CTC AGT CAA GTC TAC 

A V T E K L 
GCT GTG ACA GAG AAG CTG 

E A K F C N 
GAA GCT AAA TTT TGT AAT 

N S L C R L 
AAT AGT CTG TGT CGA CTC 

Q K G D L Q 
CAG AAG GGA GAT TTG CAA 

I N S L I K 
ATA AAT TCA TTA ATA AAA 

A Q L V K Y 
GCA CAG TTG GTG AAG TAT 



I I L Y D 1229 

ATT ATT CTG TAT GAT 3687 

T R R E V 1240 

ACG AGG AGA GAA GTG 3720 

L S L S S 1251 

CTG TCT TTG TCT TCT 3753 

Y K F I E 1262 

TAC AAG TTT ATT GAA 378 6 

D L M P T 1273 

GAT CTT ATG CCA ACA 3819 

Q K T G I 1284 

CAG AAA ACA GGT ATT 3852 

G L K D L 1295 

GGC TTA AAA GAC CTA 3885 



EEVVGLLKKLG 1306 
GAG GAG GTT GTT GGA CTG TTG AAG AAA CTC GGC 3918 

FIG. 11 
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IKLQVLINLGL 1317 

ATC AAG TTA CAG GTC TTG ATC AAT TTG GGC TTG 3951 

VYKVQQHNGI I 1328 

GTT TAC AAG GTG CAG CAG CAC AAT GGA ATC ATC 3984 



3 
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f CO 



FQFVAFIKRRQ 1339 

TTC CAG TTT GTG GCT TTC ATC AAA CGA AGG CAA 4017 

RAVPE I LAAGG 1350 

AGG GCT GTA CCT GAA ATC CTC GCA GCT GGA GGC 4 050 



RYDLLIPQFRG 1361 
AGA TAT GAC CTG CTG ATT CCC CAG TTT AGA GGG 4083 



PQALGPVPTAI 1372 
CCA CAA GCT CTG GGG CCA GTT CCC ACT GCC ATT 4116 



GVSIAIDKISA 1383 

GGG GTC AGC ATA GCT ATA GAC AAG ATA TCT GCT 414 9 

AVLNMEESVT I 1394 

GCT GTC CTC AAC ATG GAG GAA TCT GTT ACA ATA 4182 

SSCDLLVVSVG 1405 

AGC TCT TGT GAC CTC CTG GTT GTA AGT GTT GGT 4215 

QMSMSRAINLT 1416 

CAG ATG TCT ATG TCC AGG GCC ATC AAC CTA ACC 4248 

QKLWTAGITAE 1427 

CAG AAA CTC TGG ACA GCA GGC ATC ACA GCA GAA 4281 

IMYDWSQSQEE 1438 

ATC ATG TAC GAC TGG TCA CAG TCC CAA GAG GAA 4314 

LQEYCRHHE I T 1449 

TTA CAA GAG TAC TGC AGA CAT CAT GAA ATC ACC 4347 

YVALVSDKEGS 1460 

TAT GTG GCC CTT GTC TCG GAT AAA GAA GGA AGC 4380 
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HVKVKSFEKER 1471 

CAT GTC AAG GTT AAG TCT TTC GAG AAG GAA AGG 4413 

QTEKRVLETEL 1482 

CAG ACA GAG AAG CGT GTG CTG GAG ACT GAA CTT 44 4 6 

VDHVLQKLRTK 1493 

GTG GAC CAT GTA CTG CAG AAA CTG AGG ACT AAA 447 9 

VTDERNGREAS 1504 

GTC ACT GAT GAA AGG AAT GGC AGA GAA GCT TCC 4512 

DNLAVQNLKGS 1515 

GAT AAT CTT GCA GTG CAA AAT CTG AAG GGG TCA 4545 

FSNASGLFEIH 1526 

TTT TCT AAT GCT TCA GGT TTG TTT GAA ATC CAT 4578 

GATVVPIVSVL 1537 

GGA GCA ACA GTG GTT CCC ATT GTG AGT GTG CTA 4 611 

APEKLSASTRR 1548 

GCC CCG GAG AAG CTG TCA GCC AGC ACT AGG AGG 4 644 

RYETQVQTRLQ 1559 

CGC TAT GAA ACT CAG GTA CAA ACT CGA CTT CAG 4 677 

TSLANLHQKSS 1570 

ACC TCC CTT GCC AAC TTA CAT CAG AAA AGC AGT 4710 

EIEILAVDLPK 1581 

GAA ATT GAA ATT CTG GCT GTG GAT CTA CCC AAA 4743 

ETILQFLSLEW 1592 

GAA ACA ATA TTA CAG TTT TTA TCA TTA GAG TGG 477 6 

DADEQAFNTTV 1603 

GAT GCT GAT GAA CAG GCA TTT AAC ACA ACT GTG 4 809 

FIG. 1K 
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KQLLSRLPKQR 1614 

AAG CAG CTG CTG TCA CGC CTG CCA AAG CAA AGA 4842 

YLKLVCDEIYN 1625 

TAC CTC AAA TTA GTC TGT GAT GAA ATT TAT AAC 4875 

I KVEKKVSVLF 1636 

ATC AAA GTA GAA AAA AAG GTG TCT GTG CTA TTT 4 908 

LYSYRDDYYRI 1647 

CTG TAC AGC TAT AGA GAT GAC TAC TAC AGA ATC 4 941 



CCCTAAAGAACTGTCGTTAACCTCATTCAAACAGACAGAGGCTTATACTG 
GAATAATGGAATGTTGTACATTCATCATAATTTAAAATTAAATTCTAAGA 
AGAGGCTGGGTGCAGTGGCTCACACCTTTAATCCCAGCACTTTGGGAAGC 
CAAGGCAGGAAGACTGCTTGAAACCAGGAGTTTGAGACCAGCCTGAGCAA 
CAAAGCAAGACCCCATCTCTATAAAAACTAAAAAAATTAGTTGGGCATGG 
TGGCACATGCCTGTAGTCCCAGCTACTCCAGAGGCTGAGATGGATCATCT 
GAGCCTCAGGAGGTTGAGGCTGCAGTGAGCTGTGACTGCGCCACTGCACT 
CCAGTCTGGGACAACAGAGCAAGACCCTGTCTTAAAAAAAAAAAGAAAAA 
AAAATTTTTTTTCTAAGAAGCTGTCCTACAAAGTTGAGCTTTGTTAGTTT 
TTCATGTGTAATATATTATAAATTTATCTTTTGGGATATAATAAATGCTT 
TCATATACCTGCA 

FIG. 1L 

6065914_eIF2kinase_man 

147 90 MAGGRGAPGRGRDEPPESYPQRQDH 
6066585_eIF2kinase_mouse MAGGRGASGRGRAEPQESYSQRQDH 

6065914_eIF2kinase_man 

147 90 ELQALEAI YGADFQDLRPDACGP VK 

6066585_eIF2kinase_mouse ELQALEAI YGSDFQDLRPDARGRVR 

6065914_eIF2kinase_man 

147 90 EPPEINLVLYPQGLTGEEVYVKVDL 
6066585 eIF2kinase mouse EPPEINLVLYPQGLAGEEVYVQVEL 
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6065914_eIF2kinase_man 

14790 RVKCPPTYPDVVPEIELKNAKGLSN 

606658 5_e I F2 kinase_mous e QVKCPPT YPDVVPEI ELKNAKGLSN 

6065914_eIF2kinase_man 

147 90 ESVNLLKSRLEELAKKHCGEVMIFE 

606658 5_e I F2 kinas e_mou s e ES VNLLKSHLEELAKKQCGEVMI FE 

6065914_eIF2kinase_man 

147 90 LAYHVQSFLSEHNKPPPKSFHEEML 

6066585_eIF2kinase_mouse LAHHVQSFLSEHNKPPPKSFHEEML 

6065914_eIF2kinase_man 

14790 ERRAQEEQQRLLEAKRKEEQEQREI 

60 6 658 5_el F2 kinase_mous e ERQAQEKQQRLLEARRKEEQEQREI 

6065914_eIF2kinase_man 

14790 LHE I QRRKEE I KEEKKRKEMAKQER 

606658 5_e I F2 kinas e_mous e LHE I QRRKEE I KEEKKRKEMAKQER 

6065914_eIF2kinase_man 

147 90 LEIASLSNQDHTSKKDPGGHRTAAI 

6066585_eIF2kinase_mouse LEITSLTNQDYASKRDPAGHRAAAI 

6065914_eIF2kinase_man 

14790 LHGGSPDFVGNGKHRANSSGRSRRE 

6066585_eIF2kinase_mouse LHGGSPDFVGNGKARTYSSGRSRRE 

6065914_eIF2kinase_man 

14790 RQYSVCNSEDSPGSCEILYFNMGSP 

6066585_eIF2kinase_mouse RQYSVCSGEPSPGSCDILHFSVGSP 

6065914_eIF2kinase_man 

147 90 DQLMVHKGKCIGSDEQLGKLVYNAL 

606658 5_e I F2 kinas e_mous e DQLMVHKGRCVGSDEQLGKVVYNAL 

6065914_eIF2kinase_man 

147 90 ETATGGFVLLYEWVLQWQKKMGPFL 

6066585 eIF2kinase mouse ETATGSFVLLHEWVLQWQK-MGPCL 
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6065914_eIF2kinase_man 

14790 TSQEKEKIDKCKKQIQGTETEFNSL 
6066585_eIF2kinase_mouse TSQEKEKIDKCKRQIQGAETEFSSL 

6065914_eIF2kinase_man 

147 90 VKLSHPNVVRYLAMNLKEQDDSIVV 
6066585_eIF2kinase_mouse VKLSHPNIVRYFAMNSREEEDSIVI 

6065914_eIF2kinase_man 

14790 DILVEHISGVSLAAHLSHSGPIPVH 
6066585_eIF2kinase_mouse DILAEHVSGISLATHLSHSGPVPAH 

6065914_eIF2kinase_man 

14790 QLRRYTAQLLSGLDYLHSNSVVHKV 
606658 5_e I F2 kinas e_mous e QLRKYTAQLLAGLDYLHSNSVVHKV 

6065914_eIF2kinase_man 

14790 LSASNVLVDAEGTVKITDYSISKRL 
6066585_eIF2kinase_mouse LSASSVLVDAEGTVKITDYSISKRL 

6065914_eIF2kinase_man 

147 90 ADICKEDVFEQTRVRFSDNALPYKT 
6066585_eIF2kinase_mouse ADICKEDVFEQARVRFSDSALPYKT 

6065914_eIF2kinase_man 

147 90 GKKGDVWRLGLLLLSLSQGQECGEY 
6066585 eIF2kinase mouse GKKGDVWRLGLLLLSLSQGQECGEY 



6065914_eIF2kinase_man 

14790 PVTIPSDLPADFQDFLKKCVCLDDK 

6066585_eIF2kinase_mouse PVTIPSDLPADFQDFLKKCVCLDDK 

6065914_eIF2kinase_man 

14790 ERWSPQQLLKHSFINPQPKMPLVEQ 

6066585 eIF2kinase mouse ERWSPQQLLKHSFINPQPKLPLVEQ 



6065914_eIF2kinase_man 

14790 SPEDSGGQDYVETVI PSNRLPSAAF 
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6066585_eIF2kinase_mouse SPEDSGGQDYIETVIPSNQLPSAAF 

6065914_eIF2kinase_man 

147 90 FSETQRQFSRYFIEFEELQLLGKGA 
6066585 eIF2kinase mouse FSETQRQFSRYFIEFEELQLLGKGA 



6065914_eIF2kinase_man 

147 90 FGAVI KVQNKLDGCC YAVKRI PINP 

606658 5_e I F2 kinas e_mous e FGAVI KVQNKLDGCC YAVKRI PINP 

6065914_eIF2kinase_man 

147 90 ASRQFRRIKGEVTLLSRLHHENIVR 
6066585 eIF2kinase mouse ASRHFRRIKGEVTLLSRLHHENIVR 



6065914_eIF2kinase_man 

147 90 YYNAWIERHERPAGPGTPPPDSGPL 
6066585_eIF2kinase_mouse YYNAWIERHERPAVPGTPPPDCTPQ 

6065914_eIF2kinase_man 

14790 AKDDRAARGQPASDTDGLDSVEAAA 
6066585 eIF2kinase mouse AQDSPATCGKTSGDTEELGSVEAAA 



6065914_eIF2kinase_man 

14790 PPPILSSSVEWSTSGERSASARFPA 
6066585_eIF2kinase_mouse PPPILSSSVEWSTSAERSTSTRFPV 

6065914_eIF2kinase_man 

14790 TGPGSSDDEDDDEDEHGGVFSQSFL 
6066585 eIF2kinase mouse TGQDSSSDEED-EDERDGVFSQSFL 



6065914_eIF2kinase_man 

14790 PASDSESDIIFDNEDENSKSQNQDE 
6066585_eIF2kinase_mouse PASDSDSDIIFDNEDENSKSQNQDE 

6065914_eIF2kinase_man 

14790 DCNEKNGCHESEPSVTTEAVHYLYI 
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6066585 el F2 kinase mouse DCNQKDGSHEIEPSVTAEAVHYLYI 
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6065914_eIF2kinase_man 

14790 QMEYCEKSTLRDTIDQGLYRDTVRL 
6066585_eIF2kinase_mouse QMEYCEKSTLRDTIDQGLFRDTSRL 

6065914_eIF2kinase_man 

147 90 WRLFREILDGLAYIHEKGMIHRDLK 
6066585_eIF2kinase_mouse WRLFREILDGLAYIHEKGMIHRDLK 

6065914_eIF2kinase_man 

147 90 PVNI FLDSDDHVKIGDFGLATDHLA 

6066585_eIF2kinase_mouse PVNI FLDSDDHVKIGDFGLATDHLA 

6065914_eIF2kinase_man 

147 90 FSADSKQDDQTGD-LIKSDPSGHLT 
6066585_eIF2kinase_mouse FTAEGKQDDQAGDGVIKSDPSGHLT 

6065914_eIF2kinase_man 

147 90 GMVGTALYVSPEVQGSTKSAYNQKV 
6066585_eIF2kinase_mouse GMVGTALYVSPEVQGSTKSAYNQKV 

6065914_eIF2kinase_man 

147 90 DLFSLGI I FFEMS YHPMVTASERI F 

6066585_eIF2kinase_mouse DLFSLGI I FFEMS YHPMVTASERI F 

6065914_eIF2kinase_man 

147 90 VLNQLRDPTSPKFPEDFDDGEHAKQ 
6066585_eIF2kinase_mouse VLNQLRDPTSPKFPDDFDDGEHTKQ 

6065914_eIF2kinase_man 

147 90 KSVI SWLLNHDPAKRPTATELLKSE 

6066585 eIF2kinase mouse KSVI SWLLNHDPAKRPTAMELLKSE 



FIG. 2E 
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6065914_eIF2kinase_man 

147 90 LLPPPQMEESELHEVLHHTLTNVDG 
6066585_eIF2kinase_mouse LLPPPQMEESELHEVLHHTLANIDG 

6065914_eIF2kinase_man 

147 90 KAYRTMMAQI FSQRI SPAI DYT YDS 

6066585_eIF2kinase_mouse KAYRTMMSQIFCQHI SPAI DYT YDS 

6065914_eIF2kinase_man 

147 90 DILKGNFSIRTAKMQQHVCETIIRI 
6066585_eIF2kinase_mouse DILKGNFLIRTAKIQQLVCETIVRV 

6065914_eIF2kinase_man 

147 90 FKRHGAVQLCTPLLLPRNRQIYEHN 
6066585 eIF2kinase mouse FKRHGAVQLCTPLLLPRNRQIYEHN 



6065914_eIF2kinase_man 
14790 

6066585 eIF2kinase mouse 



— ALFMDHSGMLVMLPFDLRIPFAR 
EAALFMDHSGMLVMLPFDLRIPFAR 
EAALFMDHSGMLVMLPFDLRVPFAR 

****************** ; **** 



6065914_eIF2kinase_man 
14790 

6066585 eIF2kinase mouse 



YVARNNILILKRYCIERVFRPRKLD 
YVARNNILNLKRYCIERVFRPRKLD 

YVARNNILNLKRYCIERVFRPRKLD 
******** **************** 



6065914_eIF2kinase_man 
14790 

6066585 eIF2kinase mouse 



RFHPKELLECAFDIVTSTTNSFLPT 
RFHPKELLECAFDIVTSTTNSFLPT 
RFHPKELLECAFDIVTSTTNSSLPT 
********************* *** 



6065914_eIF2kinase_man AEIIYTIYEIIQEFPALQERNYSIY 

14790 AEIIYTIYEIIQEFPALQERNYSIY 

6066585_eIF2kinase_mouse AETIYTIYEIIQEFPALQERNYSIY 

** ********************** 



FIG. 2F 
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60 65 9 14_el F2 kinase_man LNHTMLLKAILLHCGIPEDKLSQVY 

14790 LNHTMLLKAILLHCGIPEDKLSQVY 

6066585_eIF2kinase_mouse LNHTMLLKAILLHCGIPEDKLSQVY 

************************* 

606591 4_e I F2 kinas e_man 1 1 LYDAVTEKLTRREVEAKFCNLSL 

147 90 I I LYDAVTEKLTRREVEAKFCNLSL 

6066585_eIF2kinas e_mou s e VI L YDAVTEKLT RREVE AKFCNLS L 

. ************************ 



6065914_eIF2kinase_man SSNSLCRLYKFIEQKGDLQDLMPTI 
147 90 SSNSLCRLYKFIEQKGDLQDLMPTI 
6066585_eIF2kinase_mouse SSNSLCRLYKFIEQKGDLQDLTPTI 

********************* *** 

606591 4_e I F2 kinas e_man NSLIKQKTGI AQLVKYGLKDLEEVV 
147 90 NSLIKQKTGI AQLVKYGLKDLEEVV 

606658 5_el F2 kinase_mouse NSLIKQKTGVAQLVKYSLKDLEDVV 

******** 



606591 4_el F2 kinase_man GLLKKLGIKLQVLINLGLVYKVQQH 

147 90 GLLKKLGIKLQVLINLGLVYKVQQH 

6066585_eIF2kinase_mouse GLLKKLGVKLQVS INLGLVYKVQQH 

*******.**** ******** *** 

6065914_eIF2kinas e_man NG 1 1 FQFVAFI KRRQRAVPE I LAAG 

147 90 NGI I FQFVAFI KRRQRAVPE I LAAG 

6066585_eIF2kinase_mouse TG I I FQFLAFSKRRQRVVPE I LAAG 

******.** ***** ******** 



606591 4_e I F2 kinas e_man GRYDLLI PQFRGPQALGPVPTAI GV 
14790 GRYDLLI PQFRGPQALGPVPTAIGV 

606658 5_e I F2 kina s e_mou s e GRYDLLI PKFRGPQT VGPVPTAVGV 

********.*****• .******.** 
• • • • 

6065914_eIF2kinase_man SIAIDKISAAVLNMEESVTISSCDL 
14790 SIAIDKISAAVLNMEESVTISSCDL 
6066585_eIF2kinase_mouse SIAIDKIFAVVLNMEEPVTVSSCDL 

******* * ****** **.***** 
• • • 

FIG. 2G 
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606591 4_el F2 kinase_man LVVSVGQMSMSRAINLTQKLWTAGI 
147 90 LVVSVGQMSMSRAINLTQKLWTAGI 
60 6 65 8 5_el F2 kinase_mouse LVVSVGQMSMSRAINLTQKLWTAGI 

************************* 



6065914_eIF2kinase_man TAEIMYDWSQSQEELQEYCRHHEIT 
14790 TAEIMYDWSQSQEELQEYCRHHEIT 
606658 5_el F2 kinase_mouse TAEIMYDWSQSQEELQEYCRHHEIT 

************************* 



606591 4_el F2 kinase_man YVALVSDKEGSHVKVKSFEKERQTE 
147 90 YVALVSDKEGSHVKVKSFEKERQTE 

6066585_eIF2kinase_mouse YVALVSDKEGSHVKVKSFEKERQTE 

************************* 



60 65914 el F2 kinase man KRVLETELVDHVLQKLRTKVTDERN 



14790 



KRVLETELVDHVLQKLRTKVT DERN 



60 66585 eIF2 kinase mouse KRVLESDLVDHVMQKLRTKVGDERN 



*****..*****•******* **** 



60 65914_eIF2kinase_man 
14790 

6066585 eIF2kinase mouse 



GREASDNLAVQNLKGSFSNASGLFE 
GREAS DNLAVQNLKGS FSNAS GLFE 
FRDASDNLAVQTLKGSFSNASGLFE 



60 65914_eIF2kinase_man 
14790 

6066585 eIF2kinase mouse 



IHGATVVPIVSVLAPEKLSASTRRR 
IHGATVVPIVSVLAPEKLSASTRRR 
IHGTTVVPNVIVLAPEKLSASTRRR 



6065914_eIF2kinase_man 
14790 

6066585 eIF2kinase mouse 



YETQVQTRLQT S LANLHQKS S E I E I 

YETQVQTRLQTSLANLHQKSSEIEI 

HE I QVQTRLQTTLANLHQKS S E I E I 
.* ********.************* 

■ • 



FIG. 2H 
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6065914_eIF2kinase_man 
14790 

6066585 eIF2kinase mouse 



LAVDLPKETILQFLSLEWDADEQAF 
LAVDLPKETILQFLSLEWDADEQAF 
LAVDLPKETILQFLSLEWDADEQAF 



6065914_eIF2kinase_man NTTVKQLLSRLPKQRYLKLVCDEIY 

14790 NTTVKQLLSRLPKQRYLKLVCDEIY 

606658 5_e I F2 kinase_mouse NTTVKQLLSRLPKQRYLKLVCDEIY 

************************* 

606591 4_e I F2 kinase_man NIKVEKKVSVLFLYS YRDDYYRILF 

14790 NIKVEKKVSVLFLYS YRDDYYRILF 

tSEQ. ID NO: 4 

6066585_eIF2kinase_mouse NIKVEKKVSVLFLYS YRDDYYRILF 

fSEQ. ID NO: 5 

************************* 



FIG. 21 
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Analysis of 14790 (1649 aa) 



PFAM 



serkinjpkinase 



pkinase 




I I I | I | I | I | I | I 1 1 | I | 1 1 I | I | I | I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M 

1 41 81 121 161 201 241 281 321 361 401 441 481 521 561 601 641 681 



>14790 

MAGGRGAPGRGRDEPPESYPQRQDHELQALEAIYGADFQDLRPDACGPV 
KEPPEINLVLYPQGLTGEEVYVKVDLRVKCPPTYPDVVPEIELKNAKGL 
SNESVNLLKSRLEELAKKHCGEVMIFELAYHVQSFLSEHNKPPPKSFHE 
EMLERRAQEEQQRLLEAKRKEEQEQREI LHE I QRRKEE I KEEKKRKEMA 
KQERLEIASLSNQDHTSKKDPGGHRTAAILHGGSPDFVGNGKHRANSSG 
RSRRERQYSVCNSEDSPGSCEILYFNMGSPDQLMVHKGKCIGSDEQLGK 
LVYNALETATGGFVLLYEWVLQWQKKMGPFLTSQEKEKIDKCKKQIQGT 
ETEFNSLVKLSHPNVVRYLAMNLKEQDDSIVVDILVEHISGVSLAAHLS 
HSGPIPVHQLRRYTAQLLSGLDYLHSNSVVHKVLSASNVLVDAEGTVKI 
TDYSISKRLADICKEDVFEQTRVRFSDNALPYKTGKKGDVWRLGLLLLS 
LSQGQECGEY 



FIG. 7 A 
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SPEDSGGQDYVETVIPSNRLPSAAFFSETQRQFSRYFIEFEELQLLGKGA 
FGAVIKVQNKLDGCCYAVKRIPINPASRQFRRIKGEVTLLSRLHHENIVR 
YYNAWIERHERPAGPGTPPPDSGPLAKDDRAARGQPASDTDGLDSVEAAA 
PPPILSSSVEWSTSGERSASARFPATGPGSSDDEDDDEDEHGGVFSQSFL 
PASDSESDI I FDNEDENSKSQNQDEDCNEKNGCHESEPSVTTEAVHYLYI 
QME YCEKSTLRDT I DQGL YRDTVRLWRLFRE I LDGLAYIHEKGMI HRDLK 
PVNIFLDSDDHVKIGDFGLATDHLAFSADSKQDDQTGDLIKSDPSGHLTG 
MVGTALYVSPEVQGSTKSAYNQKVDLFSLGI I FFEMS YHPMVTASERI FV 
LNQLRDPTSPKFPEDFDDGEHAKQKSVISWLLNHDPAKRPTATELLKSEL 
LPPPQMEESELHEVLHHTLTNVDGKAYRTMMAQIFSQRISPAIDYTYDSD 
I LKGNFS I RT AKMQQHVCET I IRI FKRHGAVQLCTPLLLPRNRQI YEHNE 
AALFMDHSGMLVMLPFDLRIPFARYVARNNILNLKRYCIERVFRPRKLDR 
FHPKELLECAFDIVTSTTNSFLPTAEIIYTIYEIIQEFPALQERNYSIYL 
NHTMLLKAILLHCGIPEDKLSQVYIILYDAVTEKLTRREVEAKFCNLSLS 
SNSLCRLYKFIEQKGDLQDLMPTINSLIKQKTGIAQLVKYGLKDLEEVVG 
LLKKLGI KLQVLI NLGLVYKVQQHNGI I FQFVAFI KRRQRAVPEI LAAGG 
RYDLLIPQFRGPQALGPVPTAIGVSIAIDKISAAVLNMEESVTISSCDLL 
VVSVGQMSMSRAINLTQKLWTAGITAEIMYDWSQSQEELQEYCRHHEITY 
VALVSDKEGSHVKVKSFEKERQTEKRVLETELVDHVLQKLRTKVTDERNG 
REASDNLAVQNLKGSFSNASGLFEIHGATVVPIVSVLAPEKLSASTRRRY 
ETQVQTRLQTSLANLHQKSSEIEILAVDLPKETILQFLSLEWDADEQAFN 
TTVKQLLSRLPKQRYLKLVCDEIYNIKVEKKVSVLFLYSYRDDYYRILF 



PSORT Prediction of Protein Localization 

MITDISC: discrimination of mitochondrial 

targeting seq 

R content: 3 Hyd Moment(75): 7.37 

Hyd Moment (95): 6.02 G content: 5 
D/E content: 2 S/7 content: 0 

Gavel: prediction of cleavage sites for 
mitochondrial preseq 

R-2 motif at 20 GRG : RD 

MUCDISC: discrimination of nuclear localization 
signals 

pat4: KKRK (5) at 190 

FIG. 7B 
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1 CO 
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CD 
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pat4: RPRK (4) at 1144 
pat7 : none 

bipartite: RRAQEEQQRLLEAKRKE at 
bipartite: KRI P I NPAS RQFRRI KG at 
content of basic residues: 12:11 
NLS Score: 1.08 

ER Membrane Retention Signals : 



152 
619 



i'8 * 

CC £0 



< if 



XXRR-like motif in the N-terminus: AGGR 



None 



Final Results (k = 9/23) 



56.5 
30.4 
4.3 
4.3 
4.3 



% 



% 



% 



% 



o . 



nuclear 
cytoplasmic 
vacuolar 
mitochondrial 

vesicles of secretory system 



prediction for 14790 is nuc (k=23) 



Start 


End 


Feature 


Seq 


144 


204 


coiled coil 


FHEEMLERRA...AKQERLEIAS 



Signal Peptide Predictions for 147 90 



Method 


Predict 


Score 


Mat@ 


SignalP (eukaryote) 


NO 







Note: amino-terminal 70aa used for signal peptide 
prediction 

Transmembrane Segments Predicted by MEMSAT 



Start 


End 


Orient 


Score 


1522 


1538 


ins — >out 


0.8 



FIG. 7C 
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>14790 



.0 

A 



l: S 




MAGGRGAPGRGRDEPPESYPQRQDHELQALEAIYGADFQDLRPDACGPVK 
EPPEINLVLYPQGLTGEEVYVKVDLRVKCPPTYPDVVPEIELKNAKGLSN 
ESVNLLKSRLEELAKKHCGEVMIFELAYHVQSFLSEHNKPPPKSFHEEML 
ERRAQEEQQRLLEAKRKEEQEQREILHEIQRRKEEIKEEKKRKEMAKQER 
LEIASLSNQDHTSKKDPGGHRTAAILHGGSPDFVGNGKHRANSSGRSRRE 
RQYSVCNSEDSPGSCEILYFNMGSPDQLMVHKGKCIGSDEQLGKLVYNAL 
ETATGGFVLLYEWVLQWQKKMGPFLTSQEKEKIDKCKKQIQGTETEFNSL 
VKLSHPNVVRYLAMNLKEQDDSIVVDILVEHISGVSLAAHLSHSGPIPVH 
QLRRYTAQLLSGLDYLHSNSVVHKVLSASNVLVDAEGTVKITDYSISKRL 
ADICKEDVFEQTRVRFSDNALPYKTGKKGDVWRLGLLLLSLSQGQECGEY 
PVTIPSDLPADFQDFLKKCVCLDDKERWSPQQLLKHSFINPQPKMPLVEQ 
SPEDSGGQDYVETVIPSNRLPSAAFFSETQRQFSRYFIEFEELQLLGKGA 
FGAVIKVQNKLDGCCYAVKRIPINPASRQFRRIKGEVTLLSRLHHENIVR 
YYNAWIERHERPAGPGTPPPDSGPLAKDDRAARGQPASDTDGLDSVEAAA 
PPPILSSSVEWSTSGERSASARFPATGPGSSDDEDDDEDEHGGVFSQSFL 
PASDSESDI I FDNEDENSKSQNQDEDCNEKNGCHESEPSVTTEAVHYLYI 
QMEYCEKSTLRDTIDQGLYRDTVRLWRLFREILDGLAYIHEKGMIHRDLK 
PVNIFLDSDDHVKIGDFGLATDHLAFSADSKQDDQTGDLIKSDPSGHLTG 
MVGTALYVSPEVQGSTKSAYNQKVDLFSLGIIFFEMSYHPMVTASERIFV 
LNQLRDPTSPKFPEDFDDGEHAKQKSVISWLLNHDPAKRPTATELLKSEL 
LPPPQMEESELHEVLHHTLTNVDGKAYRTMMAQIFSQRISPAIDYTYDSD 
I LKGNFS I RTAKMQQHVCET IIRI FKRHGAVQLCTPLLLPRNRQI YEHNE 
AALFMDHSGMLVMLPFDLRIPFARYVARNNILNLKRYCIERVFRPRKLDR 
FHPKELLECAFDIVTSTTNSFLPTAEIIYTIYEIIQEFPALQERNYSIYL 
NHTMLLKAILLHCGI PEDKLSQVYI ILYDAVTEKLTRREVEAKFCNLSLS 
SNSLCRLYKFIEQKGDLQDLMPTINSLIKQKTGIAQLVKYGLKDLEEVVG 
LLKKLGIKLQVLINLGLVYKVQQHNGIIFQFVAFIKRRQRAVPEILAAGG 
RYDLLIPQFRGPQALGPVPTAIGVSIAIDKISAAVLNMEESVTISSCDLL 
VVSVGQMSMSRAINLTQKLWTAGITAEIMYDWSQSQEELQEYCRHHEITY 
VALVSDKEGSHVKVKSFEKERQTEKRVLETELVDHVLQKLRTKVTDERNG 
REASDNLAVQNLKGSFSNASGLFEIHGATVVPIVSVLAPEKLSASTRRRY 
ETQVQTRLQTSLANLHQKSSEIEILAVDLPKETILQFLSLEWDADEQAFN 
TTVKQLLSRLPKQRYLKLVCDEIYNIKVEKKVSVLFLYSYRDDYYRILF 



FIG. 7D 
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Prosite Pattern Matches for 14790 

Prosite version: Release 12.2 of February 1995 

>PS00021 | PDOC00001 | ASN_GLYCOSYLATION N- 
glycosylation site. 



Query: 


100 


NESV 


103 


Query: 


242 


NSSG 


245 


Query: 


1055 


NFSI 


1058 


Query: 


1195 


NYSI 


1198 


Query: 


1201 


NHTM 


1204 


Query: 


1246 


NLSL 


1249 


Query: 


1414 


NLTQ 


1417 


Query: 


1518 


NASG 


1521 


Query: 


1600 


NTTV 


1603 



>PS00004 | PDOC00004 |CAMP_PHOSPHO_SITE cAMP- and 
cGMP- dependent protein kinase phosphorylation 
site . 

Query: 403 RRYT 40 6 
Query: 988 KRPT 991 
Query: 1630 KKVS 1633 

>PS00005 | PDOC00005 | PKC_PHOSPHO_SITE protein kinase 
C phosphorylation site. 



Query: 


212 


TSK 


214 


Query: 


244 


SGR 


246 


Query: 


247 


SRR 


249 


Query: 


438 


TVK 


440 


Query: 


447 


SKR 


449 


Query: 


475 


TGK 


477 


Query: 


567 


SNR 


569 


Query: 


579 


TQR 


581 


Query: 


720 


SAR 


722 


Query: 


809 


TLR 


811 


Query: 


822 


TVR 


824 


Query: 


915 


STK 


917 


Query: 


945 


SER 


947 


Query: 


959 


SPK 


961 



FIG. 7E 
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yuery : 
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O T D 


1 ARO 

1 UO y 


yuery : 


i o a c\ 
1U bU 


1 AJ\ 


lUbz 


yue ry . 


1 ooo 
1Z JZ 


TIT I? 


±Z o4 


Query: 


1236 


TRR 


1238 


Query: 


1416 


TQK 


1418 


Query: 


1455 


SDK 


1457 


Query: 


1473 


TEK 


1475 


Query: 


1545 


STR 


1547 


Query: 


1602 


TVK 


1604 


Query: 


1639 


SYR 


1641 



>PS00006 | PDOC00006 | CK2_PH0SPH0_SITE Casein kinase 
II phosphorylation site. 



Query: 


65 


TGEE 


68 


Query: 


82 


TYPD 


85 


Query: 


108 


SRLE 


111 


Query: 


144 


SFHE 


147 


Query: 


207 


SNQD 


210 


Query: 


213 


SKKD 


216 


Query: 


247 


SRRE 


250 


Query: 


326 


TSQE 


329 


Query: 


343 


TETE 


346 


Query: 


411 


SGLD 


414 


Query: 


551 


SPED 


554 


Query: 


688 


SDTD 


691 


Query: 


707 


SSVE 


710 


Query: 


713 


TSGE 


716 


Query: 


730 


SSDD 


733 


Query: 


753 


SDSE 


756 


Query : 


809 


TLRD 


812 


Query : 


880 


SKQD 


883 


Query: 


943 


TASE 


946 


Query : 


991 


TATE 


994 


Query: 


1020 


TNVD 


1023 


Query : 


1180 


TIYE 


1183 


Query: 


1236 


TRRE 


1239 



FIG. 7F 
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Query: 
Query : 
Query: 
Query : 
Query: 



1395 
1435 
1455 
1570 
1639 



SSCD 
SQEE 
SDKE 
SEIE 
SYRD 



1398 
1438 
1458 
1573 
1642 



>PS00007 | PDOC00007 | TYR_PHOSPHO_SITE Tyrosine 
kinase phosphorylation site. 



Query: 
Query: 
Query: 
Query: 
Query: 



24 6 RSRRERQY 253 

811 RDTIDQGLY 819 

830 REILDGLAY 838 

1444 RHHEITY 1450 

1617 KLVCDEIY 1624 



>PS00008 | PDOC00008 | MYRISTYL N-myristoylation site 



Query : 


218 


Query: 


384 


Query: 


494 


Query: 


599 


Query: 


613 


Query: 


684 


Query: 


742 


Query: 


782 


Query: 


900 


Query: 


914 


Query: 


1373 


Query: 


1514 



GGHRTA 223 

GVSLAA 389 

CQECGE 4 99 

GAFGAV 604 

GCCYAV 618 

GQPASD 689 

GGVFSQ 747 

GCHESE 787 

GMVGTA 905 

GSTKSA 919 

GVSIAI 1378 

GSFSNA 1519 



>PS00009 | PDOC00009 | AMIDATION Amidation site. 
Query: 475 TGKK 478 

>PS00107 | PDOC00100 | PROTEIN_KINASE_ATP Protein 
kinases ATP-binding region signature. 



FIG. 7G 



Query: 596 LGKGAFGAV 604 



>PS00108 | PDOC00100 | PROTEIN_KINASE_ST 
Serine/Threonine protein kinases active-site 
!> I signature. 



§SL-J Query: 844 MIHRDLKPVNIFL 856 

[q ij >PS00116 I PDOC00107 I DNA_POLYMERASE_B DNA polymerase 

§ s tji family B signature. 



Jl 



Query: 687 ASDTDGLDS 695 



Protein Family / Domain Matches, HMMer version 2 

Searching for complete domains in PFAM 

Hmmpfam search a single seq against HMM database 

HMMER: 2.1.1 (Dec 1998) 

Copyright (C) 1992-1998 Washington University 
School of Medicine 

HMMER is freely distributed under the GNU General 
Public License (GPL) . 



HMM file: /prod/ddm/seqanal/PFAM/pf am4 . 4/Pf am 

Sequence file: /prod/ddm/wspace/orfanal/oa- 
script . 168 95 . seq 



Query: 14790 

Scores for sequence family classification (score 

includes all domains) : 
Model Description 



pkinase Eukaryotic protein kinase domain 

Ribosomal L23 Ribosomal protein L23 
mRNA cap enzyme mRNA capping enzyme 
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pkinase 
Ribosomal 



L23 



mRNA cap_enzvme 



Score 

282.0 
5.0 
-181.3 



E-value N 

7.8e-81 4 

3.9 1 

9.6 1 



Parsed for domains: 



Model 


Domain 


seq-f 


seq-t 


hmm-f 


hmm-t 


pkinase 


1/4 


332 


443 


. . 30 


134 . . 


pkinase 


2/4 


501 


539 


. . 237 


278. ] 


pkinase 


3/4 


590 


662 


1 


66 [ 


pkinase 


4/4 


797 


1001 


. . 65 


278. ] 


Ribosomal L23 


1/1 


1223 


1244 


1 


23 [ 


mRNA cap_enzyme 


1/1 


1004 


1309 


1 


504 [ 



Score E-value 



pkinase 69.4 3.7e-18 

pkinase 22.8 3.2e-05 

pkinase 51.2 4.2e-13 

pkinase 138.7 l.le-37 

Ribosomal_L23 5.0 3.9 

mRNA_cap_enzyme -181.3 9.6 

Alignments of top-scoring domains: 
Pkinase: domain 1 of 4, from 332 to 443 
69.4, E = 3.7e-18 



score 



*->ilk. . kesls. . IrEiqilkrIsHpNIvrllg 
++++ k++++++ +E 1 +lsHpN+vr+l 
14790 332 KIDkcKKQIQgtETEFNSLVKLSHPNVVRYLA 363 

vfed. . . tddhlylvmEymegGdLf dylrrngpls 
+ +++++ + + E+ g +L+ +1+ gp++ 
14790 364 MNLKeqdDISVVDILVEHISGVSLAAHLSHSGPIP 398 

ekeakkialQilrGleYLHsngivHRDLKpeNlLl 
+++++++ Q+l+Gl+YLHsn++vH L ++N+L+ 



FIG. 71 



? £ 



'3% 

34/53 



1 o 

IS) 

I 3 



14790 399 VHQLRRYTAQLLSGLDYLHSNSVVHKVLSASNVLV 433 

dengtvKiaD<-* SEQ. ID NO: 8 
d +gtvKi+D 
14790 434 DAEGTVKITD 443 



d p> j pkinase: domain 2 of 4, from 501 to 539: score 

'■^rH 22.8, E = 3.2e-5 

OS 

0 "ff *->rlplpsncSeelkdLlkkcLnkDPskRpGsat 

1 6 II + +ps ++ +++d+lkkc ++D ++R+ + 

| " III 14790 501 PVTI PSDLPADFQDFLKKCVCLDDKERW S 529 



akeilnhpwf<-* SEQ. ID NO: 9 
+++l+h ++ 
14790 530 PQQLLKHSFI 539 



pkinase: domain 3 of 4, from 590 to 662: score 
51.2, E = 4.2e-13 

*->yelleklGeGsfGkVykakhk. tgkivAvKil 
+e 1+ lG+G+fG V k+++k +g+ +AvK + 
14790 590 FEELQLLGKGAFGAVIKVQNK1DGCCYAVKRI 621 

kkesls IrEiqilkrIsHpNIvrllgvf e 

+ s++ ++ E+ +1 rl+H+NIvr++ ++ 
14790 622 PINPASrqf rriKGEVTLLSRLHHENIVRYYNAWI 656 



FIG. 7 J 
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dtddhK-* SEQ. ID NO: 10 
++++++ 

14790 657 ERHERP 662 

pkinase: domain 4 of 4, from 797 to 1001: score 
138.7, E - l.le=37 

*<-hlylvmEymegGdLf dylrrngplsekeakki 
+ly+ mEy+e+ L+d + + + + + ++ 
14790 797 YL YI QMEYCEKSTLRDT I DQGLYRDTVRLWRL 828 

alQilrGleYLHsngivHRDLKpeNILldengtvK 
+++il Gl+Y+H++g +HRDLKp NI+ld++ +vK 
14790 829 FREILDGLAYIHEKGMIHRDLKPVNIFLDSDDHVK 863 



iaDFGLArll ekl 

i+DFGLA++ + ++++++++++ +++++++1 
14790 864 IGDFGLATDHlaf sadskqddqtgdliksdpsGHL 898 

ttfvGTpwYmmAPEvileg. . . rgysskvDvWSlG 
t+ vGT Y+ +PEv +g++++ Y kvD S1G 
14790 899 TGMVGTALYV-SPEV — QGstkSAYNQKVDLFSLG 930 

viLyElltggplfpgadlpaftggdevdqliifvl 
+i++E+ 

14790 931 IIFFEMS 937 



klPf sdelpktridpleelf rikkr . . . . rlplps 
P e++f +++ ++++ +++p+ 
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14790 938 YHPMV TASERIFVLNQLrdptSPKFPE 964 

ncSee. . . . lkdLlkkcLnkDPskRpGsatakeil 
+ + ++ + k++++++Ln DP+kRp ta+e+1 
14790 965 DFDDGehakQKSVI SWLLNHDPAKRP TATELL 996 



CD 

CD 3 
u.. 

Id 3 
Old 



n 



o 

> 5 
° & £2 

s 

Q 



nhpwf<-* SEQ. ID NO: 11 
++ + 

14790 997 KSELL 1001 

Ribosomal_L23 : domain 1 of 1, from 1223 to 1244: 
score 5.0, E = 3.9 

*->tdiikyPviTeKlamnlleepNk<-* 

tSEQ. ID NO: 12 
++ii y +TeKl+++++e ++ 
14790 1223 VYIILYDAVTEKLTRREVEA-KF 1244 

RNA__cap_enzyme : domain 1 of 1, from 1004 to 1309: 
score -181.3, E = 9.6 

*->nqtteRvyelhkiElf svpelnGKKiglgi 
q++e + e+ 1 
14790 1004 PQMEES ELHEV LHH 1017 

kLpktdteslrtmVakllglamktktfPddeGs 
L+++d++ rtm a+ + P + 

14790 1018 TLTNVDGKAYRTMMAQIFS QRISPAIDYT 104 6 

qPVsf erkdleesLkekdyfvceKTDGircshg 
+ + + ++ vce ir+ 
14790 1047 YD S D I LKGNFS I RT AKMQQHVCET I — IRI 1074 

FNRTGFLlAaLlFlvehpgleeaiSHiLSgef . 

+ +h + +++ 
14790 1075 FKRHGAVQL CTp 108 6 

.lidReknyYKQDYIDllpkrlfPrekdktkak 
1+ R +Y ++ ++ If 
14790 1087 1LLPRNRQIY EHNEAALFMD 1106 
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elptyhrgtllDGElvidinriaveqkTlrYvv 
+ + 1++ Id ++ rYv 

14790 1107 HSGMLVM-LPFD LRIPF-ARYVA 1127 



"[ FDalaisGqtviqrd. lskrLgdef ikavkKpf 

++ ++ + i+r + L+ f 
% \ 14790 1128 RNN I LNLKRYC I ERV f RPRKLDR- F HP 1153 



CD » 



0 



° i; S2 

<r o t ' 



def kkvmpdakilnqqkYNFpf kiglkhmslsy 
+e+ d+++++ f++ + ++y 

14790 1154 KELLECAFDIVTSTT NSFLPTAEIIYTIY 1182 

gqlkllkaeskmviskadampkllHinDGUft 
++ + a + 
14790 1183 EIIQEFPA L 1191 

cvrdtpyieGeiLVEPGNSYlDfnLlKWKPkee 
++++i+ + L1K 
14790 1192 QE-RNYSIYL NHTMLLKA 1208 

nTvDFelilefeevndPeldekdgf slyLdYda 
+1 + +e++ 
14790 1209 ILLHCG IPEDK 1219 

mpGELf kf slgVWqgGf nkrFevihtdqif f rv 
+ ++++ + ++++++ v 
14790 1220 LSQVYI ILY-DAVTEKLTRRE V 1240 

af qklgRlkhefaelsVsdkdwyklkaleqpld 
++ f +ls+s +1 
14790 1241 EAK FCNLSLSSNSLCRLY 1258 

GrIVEcrladieillFQegrWeylrfRdDKqqa 
+++E ++ + + ++ + q+ 

14790 1259 -KFI EQKGDLQD LMPTINSLIKQK 1281 

lKtgGYsgNhistvekvllsikDgvsiEeLlkl 
tg +++v kD +Ee+ 1 
14790 1282 — TG IAQLVKYGLKD LEEVVGL 1301 
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14790 1302 



fpGmyFAGAktlikr<-* 

k +ik 
LK KLGIKL 



SEQ. ID NO: 13 



1309 



// 

Searching for complete domains in SMART 

hrmpfam - search a single seq against HMM database 

HMMER 2.1.1 (Dec 1998) 

Copyright (c) 1992-1998 Washington University 
School of Medicine 

HMMER is freely distributed under the GNU General 
Public License (GPL) . 



HMM file: /ddm/robison/smart/smart /smart . all . hmms 
Sequence file: /prod/ddm/wspace/orf anal/oa- 
script .16895. seq 



Query: 14790 



Scores for sequence family classification 
includes all domains) : 

Model Description Score E-value N 



(score 



serkin_6 
tyrkin_6 

Parsed for domains : 
Model Domain seq-f 



184 .2 
-40.3 



2.2e-51 

2e-09 



2 
1 



seq-t hmm-f hmm-t 



serkin_6 
serkin_6 
tyrkin_6 

Model 

serkin_6 
serkin_6 
tyrkin 6 



1/2 
2/2 
1/1 



286 
590 
590 



539 
1001 
1001 



1 
1 
1 



231 
231 
280 



[] 
[] 
[] 



score E-value 



17.2 

166.8 

-40.3 



2.2e-ll 
3.5e-46 
2e-09 
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Alignments of top-scoring domains: 

serkin_6: domain 1 of 2, from 28 6 to 539: score 
17.2, E = 2.2e-ll 

*->YellkklGkGaFGkVylardkktgrlvAiKvi 
++lGk Vy a ++ tg v + 
147 90 28 6 IGSDEQLGK LVYNALETATGGFV LL 310 

k erilr 

+ + +++ ++ +++++++ ++ +++ + ++ 
14790 311 YewvlqwqkkmgpfltsqekekidkckkqiQGTET 345 



EikiLkk.dHPNIVkLydvfed dklylVmE 

E + L k HPN+V+++ + +++++ + + E 
14790 346 EFNSLVKlSHPNVVRYLAMNLKeqddsIVVDILVE 

yceGdlGdLf dllkkrgrrglrkvlsE . earf yf r 
+++G +L +1 + g+ ++ ++ r+y++ 

14790 381 HI SG — VSLAAHLSHSGP IPVhQLRRYTA 

QilsaLeYLHsqgliHRDLKPeNiLLds . .hvKla 
Q+ls+L+YLHs+ ++H L +N+L+d +++VK++ 
14790 408 QLLSGLDYLHSNSVVHKVLSASNVLVDAegTVKIT 

DFGlArql ttfvGTpeYmAPEvl . . 

D ++++1 + +++ + t v +++ 
14790 443 DYSISKRLadickedvfeQTRV RFsd 

gYgkpavDiWSlGcilyElltGkpPFpqld 

+ + + gk + D+W 1G +1 1 G+ 
14790 4 69 nalpyKTGK-KGDVWRLGLLLLSLSQGQE C 

lif kkig SpeakdLikklLvkdPekRlta . e 

+ + +++ + ++ d++kk+ + d ++R++ ++ 
14790 498 GEYPVTIpsdlPADFQDFLKKCVCLDDKERWSPqQ 

aLedeldikaHPFF<-* SEQ. ID NO: 14 
+L+ H f+ 

14790 533 LLK HSFI 539 



380 



407 



442 



468 



497 



532 



FIG. 70 
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serkin_6: domain 2 of 2, from 590 to 1001: score 
166.8, E = 3.5e-46 

*->YellkklGkGaFGkVylardkktgrlvAiKvi 
+e l++lGkGaFG V ++++k +g +A+K+i 
14790 590 FEELQLLGKGAFGAVIKVQNKLDGCCYAVKRI 621 

k 

+ ++ +++ ++ +++ + ++ ++++ + + 
14790 622 Pinpasrqf rrikgevtllsrlhhenivryynawi 656 



++++++ +++++++++++ ++++ ++++ ++++ 
14790 657 erherpagpgtpppdsgplakddraargqpasdtd 691 



+ ++ + +++ +++ + +++++++ + + + 
14790 692 gldsveaaapppilsssvewstsgersasarfpat 726 



+ ++ + +++ +++ + +++++++ + + + 
14790 727 gpgssddedddedehggvf sqsf lpasdsesdiiF 761 

rilrEikiLkk. . . dHPNIVkLydvf ed d 

+ + E + +++++d+ ++++ + +++ + 

14790 762 DNEDENSKSQNqdeDCNEKNGCHESEPSvtteavH 796 

klylVmEyceGdlGdLfdllkkrgrrglrkvlsE. 
+ly+ mEyce +L+d +++ + + 
14790 797 HLYIQMEYCEK — STLRDT I DQGL Y RDTv 823 

earfyfrQilsaLeYLHsqgliHRDLKPeNiLLds 
+ +++fr+il++L+Y+H++g iHRDLKP Ni+Lds 
14790 824 RLWRL FRE I LDGL AYI HEKGMI HRDLKPVN I FLDS 858 

. . hvKlaDFGlArql 

++hvK++DFGlA+ + ++++++++++ +++ 

14790 859 ddHVKIGDFGLATDHlafsadskqddqtgdliksd 893 

ttf vGTpeYmAPEvl gYgkpavDiW 



FIG. 7P 
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++++ t +vGT Y++PEV +++++gY+ +vD 
14790 894 psghlTGMVGTALYVSPEVQgstksAYNQ-KVDLF 927 

SIGcilyElltGkpPFp. .qldlifkkig 

SlG+i++E+ + p ++ ++++++++++++ 
14790 928 SLGI I FFEMSY-HPMVTasERI FVLNQLRd.pt spk 961 

SpeakdLikklLvkdPekRlta . eaL 

+++ ++++ k+ i+ 1L+ dP+kR+ta+e+L 
147 90 962 fpedf ddgeHAKQKSVISWLLNHDPAKRPTAtELL 996 

edeldikaHPff<-* SEQ. ID NO: 15 
+ + + 

14790 997 K SELL 1001 

tyrkin_6: domain 1 of 1, from 590 to 1001: score - 
40.3, E = 2e-09 

*->ltlgkklGeGaGFeVykGtlk. . . ieVAVKtL 
+ ++ LG GaFG V k + k ++ AVK + 
14790 590 FEELQLLGKGAFGAVIKVQNKldgCCYAVKRI 621 

keda. . . . keeFlrEakiMkklGgkHpNiVkLlGv 
+ +++ + + E ++++1 +H+NiV+ + + 
14790 622 PINPasrqFRRIKGEVTLLSRL--HHENIVRYYNA 654 

cteegrrFmevePlmivmEymegGdLldyLrknrp 
+e 

14790 655 WIE RHERP 662 

k 

+++++++++++ ++++ ++++ +++++ ++ + 
14790 663 Agpgtpppdsgplakddraargqpasdtdgldsve 697 



+++ ++++ + + +++++++ + + + +++++++ 
14790 698 aaapppilsssvewstsgersasarfpatgpgssd 732 

+++++++++++ +++ + ++++++ ++++++ 



FIG. 7Q 
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733 dedddedehggvf sqsflpasdsesdiif dneden 



767 



+++++++++ +++++ ++++++ +++ + + 
14790 768 sksqnqdedcnekngchesepsvtteavhylyiqm 802 

lslsdLlsf AlQIAkGMe 

+ ++++ +++ +++ + L I +G + 

14790 803 e y ce k s 1 1 r dt idqgl yRDT VRLWRLFRE I L DGLA 837 

YLesknf vHRDLAARNcLvgenkvvKIsDFGLsRd 
Y+++k+ +HRDL N+ ++++ +VKI+DFGL+ d 
14790 838 YIHEKGMIHRDLKPVNIFLDSDDHVKIGDFGLATD 872 

lyddDkkG eskdyYrkkggkggktllPir . 

+ ++ +++++ +d +++ +++ +++ 
14790 873 HLAF-SADskqddQTGDLIKSDPSGHLTGMVGTAl 906 

WmAPEsl . . kdgkFtskSDVWSFGVlLWEif tlGe 
+PE+ ++ ++ ++ k D +S G+ +E+ 
14790 907 YVSPEVQgsTKSAYNQKVDLFSLGIIFFEM 936 

qPYpgeiqqfmsnee . . . vleylkkGyRlpkPend 
Y + ++++e+ vl++l++ ++ Pe+ 
14790 937 -SYHPM VTASErifVLNQLRDPTSPKFPED- 965 

lpiSs . vtCPdelYdlMlqCWaedPedRPtFsel . 
++ + +++ +++++ ++dP++RPt +el 
14790 966 FdDGEHAKQKSVI SWLLNHDPAKRPT ATEL1 996 

.verl<-* SEQ. ID NO: 16 
+ e+1 

14790 997 kSELL 1001 
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Prodomld 


Start 


End 


Description 


Score 


View 

Prodom 

40346 


18 


323 


p99.2 (2)001712(1)// 
INITIATION FACTOR KINASE 
EUKARYOCTIC-2 ALPHA 
EIF-2ALPHA 


295 


View 
Prodom 

137719 


19 


1504 


p99.2 (1) 074297_NEUCR// 
CPC3 PROTEIN 


87 


View 
Prodom I 


341 


1017 


p99.2 (2773)CC2 (14) KKIT 

(14) KPC( 13) //KINASE 
PROTEIN TRANSFERASE ATP- 

BINDING SERIN/THREOINE- 

PROTEIN PHOSPHORYLATION 
RECEPTOR TYROSINE- PROTEIN 

PRECURSOR TRANSMEMBRANE 


107 


View 

Prodom 

150228 


1105 


1206 


p99.2 (1) 07 4297_NEUCR// 
CPC3 PROTEIN 


90 


View 

Prodom 

2305 


1106 


1230 


p99.2 (23) SYH(12) // 
SYNTHETASE AMINOACYL-TRNA 
HISTIDYL-TRNA PROTEIN 
LIGASE ATP-BINDING 
BIOSYNTHESIS HISTIDINE — 
TRNA HISRS KINASE 


137 


View 

Prodom 

42726 


1274 


1648 


p99. 2 (2) 0017121 (1) 061651 
(1) //INITIATION FACTOR 
KINASE EUKARYOTIC EIF-2 
ALPHA EIF-2ALPHA 


221 
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View Prodom 40346 

>40346 p99.2 (2) 001712(1) 061651(1) //INITIATION 
FACTOR KINASE EUKARYOTIC EIF-2 ALPHA EIF-2ALPHA 
Length = 296 

Score = 295(108.9 bits), Expect = 1.7e-28, Sum 
P(2) = 1.7e-28 Identities = 58/144 (40%), 
Positives = 95/144 (65%) 

Query: 18 SYPQRQDHELQALEAIYGADFQDLRPDACGPVKEP 52 

S+ +RQ EL+ +++I+G D +DLRP A + +P 
Sbjct: 9 SFRERQAQELEVIKSIFGCDVEDLRPQANPSLWKP 43 

Query: 53 PEINLVLYP — QGLTGEEVYVKVDLRVKCPPTYPD 85 

+1 + L P G E YV L V CP YP 

Sbjct: 44 TDIRIQLTPLRDSSNGLETYVCTKLHVTCPSKYPK 78 

Query: 8 6 VVPEIELKNAKGLSNESVNLLKSRLEELAKKHCGE 120 

+ P+I L+ +KG+S++ + L+++L+ +++ GE 
Sbjct: 7 9 LPPKISLEESKGMSDQLLEALRNQLQAQSQELRGE 113 
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Query: 

Sbjct : 

Query: 

Sbjct : 

Score 
= 1.7e 
30/59 

Query : 

Sbjct : 

Query: 

Sbjct : 
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121 VMIFELAYHVQSFLSEHNKPPPKSFHEEMLERRAQ 154 

VMI+ELA VQ+FL EHNKPP SF+++ML+ + + 
114 VMIYELAQTVQAFLLEHNKPPKGSFYDQMLQDKQK 148 

155 EEQQ 159 
+Q+ 

149 RDQE 152 SEQ. ID NO: 17 

= 63 (27.2 bits), Expect = 1.7e-28, Sum P(2) 

= 28 Identities = 15/59 (25%), Positives = 
(50%) 

266 EILYFN-MGSPDQLMVHKGKCIGSDEQLGKLVYNA 299 

E LYF+ MG + +G C+G ++ G + Y 

230 ETLYFHKMGR QIQRGCCVGHSQR-GCIAYTG 259 

300 LETATGGFVLLYEWVLQWQKKMGP 323 

++ G + + EW +++ + P 
260 IDMHCGQLLYITEW?IKYSQLEQP 283 

tSEQ. ID NO: 18 



View Prodom 42726 

>42726 p99.2 (2) 001712(1) 061651(1) //INITIATION 
FACTOR KINASE EUKARYOTIC EIF-2 ALPHA EIF-2ALPHA 
Length =4 69 

Score = 221 (82.9 bits), Expect = 7.5e-15, P = 
7.5e-15 Identities = 92/341 (26%), Positives 
160/341 (45%) 

fig 7i 

Query: 1274 INSLIKQKTGIAQLVKYGLKDLEEXXXXXXXXX 1306 

+ SL++ K A L + L++LE 
Sbjct: 70 LKSLMRGKGEAASLARGALRELETVVGLAYSLG 102 

Query: 1307 XXXXXXXNLGLVYKVQQ- -HNGI I FQFVAFIKR 1337 

GL + + GI++Q A +K 

Sbjct: 103 VKCPIHIWAGLPISFDRASNGGIVWQMTADLKP 135 

FIG. 7U 
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Query: 1338 RQRAVPEILAAGGRYDLLIPQF-RGPQALGPVP 1369 

+ P +LA G RYD ++ +F + Q P 
Sbjct: 136 NRSGHPSVLAIGERYDSMLHEFQKQAQKFNPAM 168 

Query: 1370 TAIGV SIAIDKISAAVLNMEESVTISS 1396 

A GV + ++DK+ AAV +E + + 

Sbjct: 169 PARGVLSGAGLTFSLDKLVAAV-GVEYAKDCRA 200 

Query: 1397 CDLLVVSVGQMSMSRAINLTQKL-WTAGITAEI 1428 

D+ + G + + +L W+ GI I 

Sbjct: 201 IDVGICVCGTRPPLKDVTYIMRLLWSVGIRCGI 233 

Query: 1429 MYDWSQSQEELQEYCRHHEITYVALVSDKEGSH 1461 

+ S+ +E Q+ R + +V LV++ GS 
Sbjct: 234 VEAASELGDEAQDLARLGAL-HVILVAEN-GS- 2 63 

Query: 14 62 VKVKSFEKERQTEKRVLETELVDHVLQKLRTKV 1494 

++V+SFE+ER E+ + TELV+ + + LR+ 
Sbjct: 2 64 LRVRS FERERFQERHLTRTELVEFI QKMLRS — 294 

Query: 1495 T DERNGRE AS DNL AVQNLKGS FSNAS 1520 

D NG DN + + GS N S 
Sbjct: 295 -DGLNGTTV-DNFSHLSALGSGDNRSSGGKERE 325 

Query: 1521 GLF-EIGHATV VPIVSV — LAPE 1540 

GL AT+ +P + V L + 

Sbjct: 32 6 RGENGLSTSASNATIKNNYSQLPNLQVTFLTHD 358 

Query: 1541 KLSASTRRRYETQVQTRLQTSLANLHQKSSEIE 1573 

K +A+ +RR E QV ++ ++L+ +K + + 
Sbjct: 359 KPTANYKRRLENQVAQQMSSTLSQFLKKETFV- 390 

Query: 1574 ILAVDLPKETI 1584 
+L V+LP + 

Sbjct: 391 VLVVELPPAVV 401 SEQ. ID NO: 19 

Score = 150 (57.9 bits), Except = 4.2e-07, P = 
4.2e-07 Identities = 66/300 (22%), Positives = 
137/300 (45%) 
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Query: 
Sbjct : 
Query: 
Sbjct: 
Query: 
Sbjct: 
Query: 
Sbjct: 
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1368 VPTAIGVSIAIDKISAAVLNMEESVTISSCDLL 1400 
V + G++ ++DK+ AAV +E + + D+ 
173 VLSGAGLTFSLDKLVAAV-GVEYAKDCRAIDVG 204 

1401 VVSVGQMSMSRAINLTQKL-WTAGITAEIMYDW 1432 
+ G + + +L W+ GI 1+ 

205 ICVCGTRPPLKDVTYIMRLLWSVGIRCGIVEAA 237 

1433 SQSQEELQEYCRHHEITYVALVSDKEGSHVKVK 14 65 
S+ +E Q+ R + +V LV++ V+ 
238 SELGDEAQDLARLGAL-HVILVAENGSLRVRSF 2 69 

14 66 SFEK — ERQTEKRVLETELVDHVLQK — LRTKV 14 94 
E+ ER + L E + +L+ L 
270 ERERFQERHLTRTEL-VEFIQKMLRSDGLNGTT 301 



Query: 1495 TDERNGREA SDNLAV QNLKGSFSN 1518 

D + A DN + + G ++ 

Sbjct: 302 VDNFSHLSALGSGDNRSSGGKERERGENGLSTS 334 

Query: 1519 ASGLFEIHGATVVPIVSV — LAPEKLSASTRRR 1549 

AS + + +P + V L +K +A+ +RR 

Sbjct: 335 ASNATIKNNYSQLPNLQVTFLTHDKPTANYKRR 367 

Query: 1550 YETQVQTRLQTSLANLHQKSSEIEILAVDLPKE 1582 

E QV ++ ++L+ +K + + +L V+LP 
Sbjct: 368 LENQVAQQMSSTLSQFLKKETFV-VLVVELPPA 399 

Query: 1583 TILQFLSL--EWDADEQAFNTTVKQLLSRLPK- 1612 

+ + + ++ + ++ R K 

Sbjct: 400 VVNAIVGAINPREIRKRETEPEINYVIERFSKY 432 

Query: 1613 QRYLKLVCDEIYNIKVEKKVSVLFLYSYRDDYY 1645 

+RY+ + +E+ + + K ++ LYS D YY 
Sbjct: 433 KRYISEINEEVVDYLSDAKTPIVALYSISDSYY 4 65 

Query: 1646 RIL 1648 
R++ 

Sbjct: 466 RVI 468 SEQ. ID NO: 20 
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View Prodom 2305 

>2305 p99.2 (23) SYH(12) // SYNTHETASE AMINOACYL- 
TRNA HISTIDYL-TRNA PROTEIN LIGASE ATP-BINDING 
BIOSYNTHESIS HISTIDINE--TRNA HISRS KINASE Length = 



o:L_J Score = 137 (53.3 bits), Expect = 3.9e-08, P = 

3.9e-08 Identities = 41/131 (31%), Positives = 

i' o 
!■ iu 

>! > 

j* S 



62/131 


(47%) 


Query: 


1106 


Sbjct : 


18 


Query: 


1135 


Sbjct: 


51 


Query: 


1166 


Sbjct: 


83 


Query: 


1199 


Sbjct : 


116 



50 



82 



£J D G L+ L +DL +PFARYVA N + L L 



KRY I +V+R + + R +E +C FDI+ 



++ P + + N+ I 



+NH +L ++L P K Q Y+ Y A 

CINHRGILDSLLQ PWPKTLQEYLTQYKA 14 3 

tSEQ. ID NO: 21 

View Prodom 1 

>1 p99.2 (2773) CC2(14) KKIT(14) KPC1(13) //KINASE 
PROTEIN TRANSFERASE ATP-BINDING SERINE /THERONINE- 
PROTEIN PHOSPHORYLATION RECEPTOR TYROSINE- PROTEIN 
PRECURSOR TRANSMEMBRANE Length = 431 



Score = 107 (42.7 bits), Expect = 0.0032, Sum P(2) 
= 0.0032 Identities = 36/106 (33%), Positives = 
50/106 (47%) 
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Query: 848 DLKPVNI FLDSDDH VK- I GDFGLATDHLA 875 

DLKP NI LD + H +K I DFGLA + 

Sbjct: 220 DLKPENILLDEESHENTPNMIKLIADFGLAKE — I 253 

Query: 87 6 FSADSKQDDQTGDLIKSDPSGHLTGMVGTALYVS- 909 

+S+ S ++ + + M GT YVS 

Sbjct: 254 YSSSSTYEEMSSSQAVFGSHQTTSTMCGTPYYVSM 288 

Query: 910 PEVQGSTKSA YNQKVDLFSLGI I FFE 935 

PE SA Y+ K D++S G+I +E 

Sbjct: 289 KSMAPEYMAPESSATNYQKYSTKSDVWSFGVILYE 323 

Query: 936 M 936 
M 

Sbjct: 323 M 323 SEQ.ID NO: 22 

Score = 105 (42.0 bits), Expect = 3.5e-05 Sum 
P(3)=3.5e-05 Identities = 35/102 (34%), Positives 
= 48/102 (47%) 

Query: 824 RLWRL FRE I L DGL AY I HEK GMIHR DL 849 

+L +1 GL Y+H K G+IHR DL 

Sbjct: 187 QLMHYVHQIAKGLEYLHSKNQKHQGIIHRAKKVDL 221 

Query: 850 KPVNI FLDSDDH VK- IGDFGLATDHLAFS 877 

KP NI LD + H +K I DFGLA + +S 

Sbjct: 222 KPENILLDEESHENTPNMIKLIADFGLAKE — IYS 254 

Query: 878 ADSKQDDQTGDLIKSDPSGHLTGMVGTALYVS 909 

+ S ++ + + M GT YVS 

Sbjct: 255 SSSTYEEMSSSQAVFGSHQTTSTMCGTPYYVS 286 

tSEQ.ID NO: 23 

Score = 74 (31.1 bits), Expect = 0.91, Sum P(2) = 
0.60 Identities = 30/127 (23%), Positives = 60/127 
(47%) 

Query: 341 QGTETEFNSLVKLSHPNVVRYLAMNLKEQDDSIVV 375 

+G+ E+ + + ++ L++ + +++ 

Sbjct: 134 EGSLVEYMEYMSGGSEDYMKKLSLETVMKIAMMIL 168 
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Query: 37 6 DIL-VEHI SGVSLAAHLSHSGPI PVHQLRRYTAQL 409 

+ + H+S S + LSHS QL Y Q+ 

Sbjct: 169 QFMQIMHMSSESES — LSHS QLMHYVHQI 195 

Query: 410 LSGLDYLHSNS VVHKV LSASNVLVDA 435 

GL+YLHS + ++H+ L N+L+D 

Sbjct: 196 AKGLEYLHSKNQKHQGIIHRAKKVDLKPENILLDE 230 

Query: 436 EG TVK-ITDYSISKRL 450 

E +K I D+ ++K + 

Sbjct: 231 ESHENTPNMIKLIADFGLAKEI 252 

tSEQ.ID NO: 24 

Score = 65 (27.9 bits), Expect = 3.5e-05, Sum P(3) 
= 3.5e-05 Identities = 29/124 (23%), Positives = 
50/124 (40%) 

Query: 907 YVSPEVQGSTKSAYNQKVDLFSLGI I FFEM-S YHP 940 

Y++PE + Y+ K D++S G+I +EM + P 
Sbjct: 294 YMAPESSATNYQKYSTKSDVWSFGVILYEMLTGKP 328 

Query: 941 MVTASERIFVLNQLRDPTSPKFPEDFDDGEHAKQK 975 

E +++ S K E + G + 

Sbjct: 329 PFFPGES — EVSEEEPYQSMKNMEVLEMGPEETIQ 3 61 

Query: 97 6 S VI SWLLNHDPAKRPT ATELLKS 998 

V+S ++ + P A +LLK 

Sbjct: 362 KVMSKIVEKKGERMPQPSSSNCPEVSQEAKDLLKK 396 

Query: 999 ELLPPPQMEESELHEVLHH 1017 

L P+ E+L H 

Sbjct: 397 CLQKDPEKRRPTFEEILQH 415 SEQ.ID NO: 25 

Score = 55 (24.4 bits), Expect 3.5e-05, Sum P(3) = 
3.5e-05 Identities = 12/23 (52%), Positives 18/23 
(78%) 

Query: 589 EFEELQ-LLGKGAFGAVIKVQNK 610 

++E L+ LLGKG+FG V K ++K 
Sbjct: 33 QYELLKKLLGKGS FGKVYKAKHK 55 

tSEQ.ID NO: 26 
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Score = 49 (22.3 bits), Expect 34., Sum P(2) = 1.0 
Identities = 12/39 (30%), Positives = 24/39 (61%) 

Query: 507 DLPADFQDFLKKCVCLD-DKERWSPQQLLKHS-FI 539 

++ + +D LKKC+ D +K R + +++L+H F+ 
Sbjct: 385 EVSQEAKDLLKKCLQKDPEKRRPTFEEILQHPWFL 419 

Query: 540 NPQP 543 

P 

Sbjct: 420 MRNP 423 SEQ.ID NO: 27 

Score = 40 (19.1 bits), Expect = 0.0010, Sum P(3) 
= 0.0010 Identities = 8/11 (72%), Positives = 9/11 
(81%) 

Query: 596 LGKGAFGAVIK 606 

LG G+FGAV K 
Sbjct: 2 LGTGSFGAVYK 12 SEQ.ID NO: 28 

View Prodom 150228 

>150228 p99.2 (1) 074297_NEUCR //CPC3 PROTEIN 
Length = 108 



Score = 90 (36.7 bits), Expect = 0.0039, P = 
0.0039 Identities = 32/105 (30%), Positives = 
43/105 (40%) 

Query: 1105 MDHSGMLVMLPFDLRI PFARYVAR — NNILN 1133 

+D +G ++ LPFDL + AR +AR N+ + 
Sbjct: 3 LDQNGTVLQLPFDLMMGHARSLARITNSPVV 33 

Query: 1134 LKRYCIERVFRPRKLDRFHPKELLECAFDIVT 1165 

K Y +FR R P E FDIVT 

Sbjct: 35 QKSYSFGNIFRDRH-GGGQPDVYGEVDFDIVP 65 

Query: 1166 STTNSF-LPXXXXXXXXXXXXQEFPALQERNY 1196 
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S L FP + 

Sbjct: 66 SDALDLALKEAEVIKVLDEIATAFPTVSSTPI 97 

Query: 1197 SIYLNHTMLL 1206 

L H+ LL 

Sbjct: 97 CFQLGHSDLL 106 SEQ.ID NO: 2 9 
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View Prodom 13771 



5 >137719 p99.2(l) 074297 NUCR / / CPC3 PROTEIN Length 

~d = 304 



Score = 87 (35.7 bits), Expect 0.61, Sum P(2) = 

(0.45)Identities=41/146 (26%), Positives = 64/156 
(41%) 

Query: 19 YPQRQDHELQALEAIYGADFQDLRPDACGPVKEPP 53 

Y + Q+ E+ L+AIYG DF K P 

Sbjct: 44 YQEVQESEVMVLQAIYGEDFTQHEAAHGAWQKSEP 78 

Query: 54 EINLVLYPQGLTGEEVYVKVDLRVKCPPTYPDVVP 88 

++ + P + +E+ V L V TYP P 
Sbjct: 7 9 RFDIKIKPS — SDQEL — SVTLGVVMVATYPKTPP 109 

Query: 89 EIELKNAKGLSNESVNLLKSRLEELAKK HCGE 120 

+ +K+ L ES + E K + 

Sbjct: 110 LLTIKDDHSL-RESTKFKIQKFVETQPKIYAQAEQ 143 

Query: 121 VMIFELAYHVQSFLSE--HNK PP PKS 14 4 

MI ++ ++ L E K P ++ 

Sbjct: 144 EMIDQIVEGIRDILEEAAQKKVQGLEIPSLEEERA 178 

Query: 145 FHEEMLERRAQEEQQR 160 

HE L R AQ E++R 
Sbjct: 179 AHEAELARLAQSEKER 194 SEQ.ID NO: 30 

Score = 49 (22.3 bits), Expect =0.61, Sum P(2) = 
0.45 Identities = 13/48 (27%), Positives = 27/48 
(56%) 
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Query 
Sbjct : 
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1458 EGSHVKVKSFEKERQTEKRVLETELVDHVL 1487 
E ++ EKER+ K++ E++ + VL 
181 EAELARLAQSEKEREERKKLEESKEEERVL 210 



Query: 14 88 QK-LRTKVTDERNGREAS 1504 

+ L+ ++ +RN + S 
Sbjct: 211 EDMLQEELKRQRNKAKES 228 SEQ.ID NO: 31 

Score = 49 (22.3 bits), Expect =0.61, Sum P(2) = 
0.45 Identities = 13/48 (27%), Positives = 27/48 
(56%) 



Query 
Sbjct 
Query 
Sbjct 



238 KHRANSSGRSRRERQYSVCNSEDSPGSC-EIL 268 

+++A S+RQS + PG EL 
222 RNKAKESRKKNRSHQLSPDRAPQDPGETDETL 253 

2 69 YFNMGSPDQLMVHKGKCIGSDEQLGKLVY 297 

F+ P ++ G + +GK V+ 

254 MFDQ— PCKITDGSGNALFFQTVIGKTVF 280 

SEQ.ID NO: 32 



Score = 47 (21.6 bits), Expect = 0.96, Sum P(2) = 
0.62 Identities = 19/83 (22%), Positives = 33/83 
(39%) 



Query: 750 LPASDSESDIIFDNEDENSKSQ-NQDEDCNEKNGC 783 

L S E ++ D E K Q N+ ++ +KN 
Sbjct: 200 LEE S KEEERVLEDMLQEELKQRQRN YAKES RKKNR 234 

Query: 784 HESE PS VTTEAVHYLYI QME YCEKS TLRDT I DQGL 818 
H+P + + + ++D L 

Sbjct: 235 HQLSPDRAPQDPGETDETLMFDQPCKITDGSGNAL 269 

Query: 819 YRDTVRLWRLFRE 831 

+ TV +FRE 
Sbjct: 270 FFQTVI GKTVFRE 282 SEQ.ID NO: 33 
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